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Abstract 

Motivated by peer-to-peer (P2P) networks and content delivery applications, we study Capacitated 
Selfish Replication (CSR) games, which involve nodes on a network making strategic choices regarding 
the content to replicate in their caches. Selfish replication games were introduced in ||6], who analyzed 
the uncapacitated case leaving the capacitated version as an open direction. 

In this work, we study pure Nash equilibria of CSR games with an emphasis on hierarchical net- 
works, which have been extensively used to model the communication costs of content delivery and P2P 
systems. The best result from previous work on CSR games for hierarchical networks f^PTTl is the ex- 
istence of a Nash equilibrium for a (slight generalization of a) 1-level hierarchy when the utility function 
is based on the sum of the costs of accessing the replicated objects in the network. Our main result is an 
exact polynomial-time algorithm for finding a Nash Equilibrium in any hierarchical network using a new 
technique which we term "fictional players". We show that this technique extends to a general frame- 
work of natural preference orders, orders that are entirely arbitrary except for two natural constraints - 
"Nearer is better" and "Independence of irrelevant alternatives". This axiomatic treatment captures a vast 
class of utility functions and even allows for nodes to simultaneously have utility functions of completely 
different functional forms. 

Using our axiomatic framework, we next study CSR games on arbitrary networks and delineate the 
boundary between intractability and effective computability in terms of the network structure, object 
preferences, and the total number of objects. In addition to hierarchical networks, we show the existence 
of equilibria for general undirected networks when either object preferences are binary or there are two 
objects. For general CSR games, however, we show that it is NP-hard to determine whether equilibria 
exist. We also show that the existence of equilibria in strongly connected networks with two objects 
and binary object preferences can be determined in polynomial time via a reduction to the well-studied 
even-cycle problem. Finally, we introduce a fractional version of CSR games (F-CSR) with application 
to content distribution using erasure codes. We show that while every F-CSR game instance possesses 
an equilibrium, finding an equilibrium in an F-CSR game is PPAD-complete. 
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Anand. This work was also partially supported by NSF grants CCF-0635119 and CNS-0915985. 



1 Introduction 

Consider a P2P movie sharing service where you need to decide which movies to store locally, given your 
limited disk space, and which to obtain from your friends. Note that your decisions affect those of your 
friends, who in turn take actions that affect you. A natural question arises: what is the prognosis for you 
and your network of friends in terms of the stability of your movie collections and the satisfaction you will 
derive from them? Similarly, in the brave new wireless world of 4G you will not only be a consumer of 
different apps, you (your personal communications and computing device) will also be a provider of apps to 
others around you. And the question arises: could this lead to a situation of endless churn (in terms of what 
apps to store) or could there be an equilibrium? 

In this paper, we study Capacitated Selfish Replication (CSR) Games, which provide an abstraction of 
the above scenarios. These are games in which the strategic agents, or players, are nodes in a network. The 
nodes have object preferences as well as bounded storage space - caches - in which they can store copies 
of the content. Each node cooperates with other nodes by serving their requests to access objects stored in 
its cache. However, the set of objects that a node chooses to store in its cache is entirely based on its own 
utility function and where objects of interest have been stored in the network. 

Such a game-theoretic framework was first introduced by Chun et al lIU, who analyzed pure Nash equi- 
libria in a setting with storage costs but no cache capacities, and left the capacitated version as an open 
direction. Recent work on CSR games has focused on hierarchical networks, which have been extensively 
used to model the communication costs of content delivery and P2P systems. (For instance, see the seminal 
work of IIT4I that uses the ultrametric model for content delivery networks and the work of of l[22l[T6l[T7ll32l 
on cooperative caching in hierarchical networks.) The best result from previous work on CSR games for 
hierarchical networks [21, 27 1 is the existence of a Nash equilibrium for (a slight generalization of) a one- 
level hierarchical network using the sum utility function, i.e., when the utility of each node is based on a 
weighted sum of the cost of accessing the objects. 

1.1 Our results 

This paper studies the existence and computability of Nash equilibria for several variants of CSR games, 
with a particular focus on hierarchical networks. As with earlier studies (611211 |27l[ll, we focus on the case 
where all pieces of content have the same size; note that otherwise even computing the best response of a 
player (node) is a generalization of the well-known knapsack problem and is NP-hard. 

• Our main result is a polynomial-time algorithm for finding a Nash equilibrium for CSR games in 
any hierarchical network, thus resolving the question left open by |[T9ll20llT7l . Our algorithm, pre- 
sented in SectionJBJ is based on a new technique that we call the method of "fictional player^' where 
we introduce and eliminate fictional players iteratively in a controlled fashion, maintaining a Nash 
equilibrium at each step, until the end when we have the desired equilibrium for the entire network 
(without any fictional players). 

The above result is presented specifically in the context of the sum utility function to elucidate the 
technique of fictional players. We then abstract the central requirements for our proof technique and develop 
a general axiomatic framework which allows us to extend our results to a large class of utility functions. 

• We present, in Section [4j a general framework for CSR games involving utility preference relations 
and node preference orders. Rather than specifying a numerical utility assigned by each node to each 
placement of objects, we only require that the preference order each node has on object placements 
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Table 1: Existence and computability of equilibria in CSR games. Each cell (other than in the first row or the first 
column) first indicates whether equilibria always exist in the particular sub-class of CSR games. If equilibria always 
exist, then the cell next indicates the complexity of determining an equilibrium; otherwise, it indicates the complexity 
of determining whether equilibria exist for a given instance. The relevant subsection is given in parentheses. 



satisfy two natural constraints of Monotonicity (or "Nearer is better") and Consistency (or "Indepen- 
dence of irrelevant alternatives"). This axiomatic treatment captures a vast class of utility functions 
and even allows for nodes to simultaneously have utility functions of completely different functional 
forms. 

We extend our result for hierarchical networks to the broader class of utilities allowed by the axiomatic 
framework, and then study general CSR games obtained by considering different network structures 
(directed or undirected) and different forms of object preferences (binary or general). We delineate 
the boundary between intractability and effective computability of equilibria in terms of the network 
structure, object preferences, and the total number of objects. These results are presented in Sections[5] 
and[6]and summarized in Table[T] Notable results include: (1) the existence of equilibria for undirected 
networks with two objects that also utilizes the technique of fictional players, (2) the existence of 
equilibria for undirected networks when object preferences are binary, and (3) the equivalence of 
finding equilibria in CSR games with two objects and binary object preferences to the well-studied 
even-cycle problem ||29l . 



Our last set of results concerns fractional CSR (F-CSR) games, where each node is allowed to store 
fractions of objects. In our framework, a node can satisfy an object access request by retrieving any set of 
fractions of the object as long as these fractions sum to at least one. As we discuss in Section [TJ a natural 
implementation of this framework is via erasure codes (e.g., using the Digital Fountain approach B1,[3ll). 

• We show that F-CSR games always have equilibria, and the problem of finding an equilibrium is in 
PPAD. We also show, however, that finding equilibria is PPAD-hard even for a sum-of-distances 
utility function. 



1.2 Related work 

In the last decade there has been a tremendous flowering of research at the intersection of game theory 
and computer science [24|. In a seminal paper ||26ll Papadimitriou laid the groundwork for algorithmic 
game theory by introducing syntactically defined subclasses of FNP with complete problems, PPAD being 
a notable such subclass. Recently, in a major breakthrough 2-player Nash Equilibrium was shown to be 
PPAD-complete LZ^. The term PPAD-complete is coming to occupy a role in algorithmic game theory 
analogous to the term NP-complete in combinatorial optimization ifTTl . 

Selfish caching games were introduced in [6] who considered the uncapacitated case where nodes could 
store more pieces of content by paying for the additional storage. We believe that limits on cache-capacity 
model an important real-world restriction and hence our focus on the capacitated version which was left as 



an open direction by |l6l. Special cases of the integral version of CSR games have been studied. In |s2T1|. 
Nash equilibria were shown to exist for when nodes are equidistant from one another and a special server 
holds all objects. [271 slightly extends [21] to the case where special servers for different objects are at 
different distances. Our results generalize and completely subsume all these prior cases of CSR games. The 
Market sharing games defined by |[T2ll also consider caches with capacity, but are of a very special kind; 
unlike CSR games, market sharing games are a special case of congestion games. In this work we focus 
primarily on equilibria and our general axiomatic framework has the flavor of similar frameworks from the 
theory of social choice [2|; in this sense, we deviate from prior work [,9, 8] that is focused on the price of 
anarchy ifTSl . 

There has been considerable research on capacitated caching, viewed as an optimization problem. Vari- 
ous centralized and distributed algorithms have been presented for different networks in lin[3l l22l[T6l[33l . 

2 A basic model for CSR games 

We consider a network consisting of a set V of nodes labeled 1 through n = |l^| sharing a collection O of 
unit-size objects. For any i and j in V, let dij denote the cost incurred at i for accessing an object at j; we 
refer to d as the access cost function. We say that j is node i's nearest node in a set S of nodes if j is in 
S and dij < dik for all k in S. We say that the given network is undirected if d is symmetric; that is, if 
dij = dji for all i, j in V. We call an undirected network hierarchical if the access cost function forms an 
ultrametric; that is, if dik < maxjdjj, djk} for all i, j, k € V. 

Each node i has a cache to store a certain number of objects. The placement at a node i is simply the 
set of objects stored at i. The strategy set of a given node is the set of all feasible placements at the node. 
A global placement is any tuple (Pj : i ^ V), where Pj C O represents a feasible placement at node i. For 
convenience, we use P-i to denote the collection {Pj : j ^ V \ {i}), thus often using P = {Pi, P^i) to 
refer to a global placement. We also assume that V includes a (server) node that has the capacity to store all 
objects. This ensures that at least one copy of every object is present in the system; this assumption can be 
made without loss of generality since we can set the access cost of every node to this server to be arbitrarily 
large. 

CSR Games. In our game-theoretic model, each node attaches a utility to each global placement. We 
assume that each node i has a weight ri{a) for each object a representing the rate at which i accesses a. 
We define the sum utility function Us{i) as follows: Us{i){P) = — ^^.^o ''«('^) ' ^iajp.a)^ where o"j(P, a) 
is i's nearest node holding a in P. 

A CSR game is a tuple {V, O, d, {rj}). Our focus is on pure Nash equilibria (henceforth, simply equi- 
libria) of the CSR games we define. An equilibrium for a CSR game instance is a global placement P such 
that for each i ^ V there is no placement Qi such that Us{i){P) > Us{i){Q). 

Unit cache capacity. In this paper, we assume that all objects are of identical size. Under this assumption, 
we now argue that it is sufficient to consider the case where each node's cache holds exactly one object. 
Consider a set V of nodes in which the cache of node i can store q objects. Let V' denote a new set of 
nodes which contains, for each node i in V, new nodes ii,i2, ■ ■ ■ , ia, i-S-, one new node for each unit of 
the cache capacity of i. We extend the access cost function as follows: dj^j^. = dji for all 1 < ^ < Cj, 
l<k<Ci. 

We consider an obvious onto mapping / from placements in V' to those in V. Given placement P' for 
V', we set f{P') = P where Pi = Ui<fc<Cj-P/ . This mapping ensures that Us{i){P') = Us{i){P), giving 
us the desired reduction. Thus, in the remainder of the paper, we assume that every node in the network 
stores at most one object in its cache. 



3 Hierarchical networks 

In this section, we give a polynomial-time construction of equilibria for CSR games on hierarchical net- 
works. Any hierarchical network can be represented by a tree T whose set of leaves is the node set V and 
every internal node v has a label £{v) such that (a) if v is an ancestoinof w in T, then i{v) > £{w), and (b) 
for any i,j in V, dij is given by ^(lca(i,j)), where lca(i, j) denotes the least common ancestor of nodes i 
andjimiTll. 

Fictional players. In order to present our algorithm, we introduce the notion of a fictional player. For an 
object a, Si fictional a-player is a new node that stores a in any equilibrium; for any fictional a-player £, 
r£{a) is 1 and r£(/3) is for any /3 / a. Each fictional player is introduced as a leaf in the current hierarchy; 
the exact locations in the hierarchy are determined by our algorithm. The access cost function is naturally 
extended to the fictional players using the hierarchy and the labels of the internal nodes. In the following, 
we use "node" to refer to both the elements of V and fictional players. 

A preference relation. The hierarchical network and the weights that nodes have for different objects 
induce, for each node i, a natural preorder Zlj among elements of O x Ai, where Ai is the set of proper 
ancestors of i in T. Specifically, we define {a,v) Dj {P,w) whenever ri{a) ■ £{v) > ri{P) • i{w). We 
can now express the best response of any player directly in terms of these preference relations. We define 

fii{P) = (a, v) where Pi = {a} and v is lca(i, ai{P^i, a)), where ai{P^i, a) denotes i's nearest node in 
the set of nodes holding a in P_j. 

Lemma 1. A best response Pi of a node i for a placement P-i ofV\ {%} is {a} where a maximizes 

(7, lca[i, ai{P-i, 7))), over all objects 7, according to Zlj. 

Proof. For a given placement P with Pi = {a}, C/s(z)(P) equals — ^,rj(7)£(lca(i, crj(P_i, 7))), which 
can be rewritten as — (^ Qri(7)^(lca(i,cri(P_i,7)))) + ri{a) ■ i{lca.{i, ai{P-i, a))). Thus, {a} is a best 
response to P_j if and only if a maximizes ri{'y) ■ £(lca(z, <Tj(P_j, 7)) over all objects 7. The desired claim 
follows from the definition of Zli- D 

The algorithm. We introduce several fictional players at the start of the algorithm. We maintain the invariant 
that the current global placement is an equilibrium in the current hierarchy. As the algorithm proceeds, the 
set of fictional players and their locations change as we remove existing fictional players or add new ones. 
On termination, there are no fictional players leaving us with a desired equilibrium. Let Wt and P* denote 
the set of fictional players and equilibrium, respectively, at the start of step t of the algorithm. 

Initialization. We add, for each object a and for each internal node t> of T, a fictional a-player as a leaf 
child of v; this constitutes the set Wq. The initial equilibrium P^ is defined as follows: for each fictional 
a-player i, we have P^ = {a}; each node iinV plays its best response. Clearly, each fictional player is in 
equilibrium, by definition. Furthermore, for every a, every imV has a sibling fictional a-player. Thus, the 
best response of every i in y is independent of the placement of nodes mV \{i], implying that P" is an 
equilibrium. 

Step t of algorithm. Fix an equilibrium P* for the node set V UWt. liWt is empty, then we are done. 
Otherwise, select a node j in Wt. Let Pj = {a}, and let Hj{P^) = (a, v). Let S denote the set of all nodes 
i £ V such that (a, v) Zlj fii{P^). We now describe how to compute a new set of fictional players Wt+i and 
a new global placement P*+i such that P*+i is an equilibrium for V U Wt+i. We consider two cases. 

• 5" is empty: Remove the fictional player j from Wt and the hierarchy, and leave the placement in the 
remaining nodes as before. Thus Wt+i = Wt — {j} and P*+i is the same as P* except that P*^^ is 



^We adopt the convention that each node is both descendant and ancestor of itself. 



S is nonempty: Select a node i in 5 such that lca(i, j) is lowest among all nodes in S. Let P/ = {/3}. 
We set P^^^ = {a}, remove the fictional a-player j from Wt, and add a new fictional /3-player i 
as a leaf sibling of i in T; i.e., -P/^^ = {/3}. For every other node j, set P*^^ = P*. Finally, set 
P7t+i = {Wt U {fc}) \ {j}. 







Figure 1: Illustrating the algorithm for hierarchical networks; the square is a node i in V^ holding object /3, and the 
hexagon is a fictional a-player 

Lemma 2, Consider step t of the algorithm. IfP* is an equilibrium for VUWt, then the following statements 
hold. 

1. For every node kinVU Wt+i, Pj^ is a best response to P^ . 

2. For every node k inV L) Wt+i, /Xfc(P*+^) ^^ l^k{P*)- 

3. We have \Wt+i\ < \Wt\. Furthermore, either |Wj+i| < \Wt\ or there exists a node i in V such that 

fii{P'+') D. fi,{P'). 

Proof. Let a, v, S, i, and j be as defined in step t of the algorithm above, and illustrated in Figure [T] We 
first establish statements 1 and 2 of the lemma. Let k be any node in y U Wt+i. Consider first the case 
where lca(fc, j) is an ancestor of v (i.e., k is not in the subtree rooted at the child uofv that contains j). For 
any object 7, we have ak{P^_X^,j) = crk{Ptk,l) and P^+^ = P^. It thus follows that /ife(P*+^) = /ife(P*), 
implying statement 2 for k. Since P* is in equilibrium, statement 1 also holds for k. 

We next establish statements 1 and 2 for any node k where lca{k,j) is a proper descendant of v (i.e., k 
is in the subtree rooted at the child u of v that contains j). We consider two cases. The first case is where 
S is empty. In this case, the fictional a-player j is removed; thus j is not in Wt+i. Furthermore, there is no 
copy of a in the subtree rooted at u. Since no object other than a is created or removed in this case, we have 
^k{Pf,l) = ^fc(^!fe, 7) for 7 / «■ We also have lca(fc, ak{P'_l\a)) = v and /Zfc(P*+i) = ^,k{Pt), the 
latter establishing statement 2 for k. Since S is empty, iJ.k{P*) 3fc (a, v). It follows from Lemmaflland the 
fact that Pj, is in equilibrium that P^^^ is a best response against P^^/, establishing statement 1 for k. 

The second case is where S is not empty. Let i be as defined above, i.e., i is a node in S such that 
lca(z, j) is lowest among all nodes in S. Let x denote lca(z, j). Let P/ be equal to {/3}, where /3 7^ a. 
By the algorithm, we have P^^ = {a}. Let k ^ i he a node in the subtree rooted at u. For any 7 / 
a, cTfe(P!+/,7) = cJkiPtWl)- Since P*+i = P* / {«}, we have /ifc(P*+') = l^k{P'), establishing 
statement 2 for k. For node i, we have fii{P^^^) = (a, v) Zlj ^i{P^), establishing statement 2 for i. 

It remains to establish statement 1 for any node k in the subtree rooted at u. We again separate into two 
cases. Let y be the child of x that is an ancestor of j (see Figure [TJc)). In the first case, we let k be in the 



subttee rooted at y. Then, by our choice of i, we have 

^fc(P*+') 3fc (a,f) 3fc (a,x) = (a,cTfc(Pit''«))' 

which, by Lemma [T] implies that statement 1 holds for k. In the second case, we let k be in the subtree 
rooted at u but not in the subtree rooted at y. Again, cjfc(P^'^^, 7) = (Tfc(Pl^, 7) for 7 / a. And for a we 
have 

(a,lca(fc,afe(P!+\a))) = (a,lca(fc,i)) Dfc (a,x) 3fc /xfc(P*) = /ifc(P*+i), 

establishing statement 1 for k using Lemma [T] 

We finally establish statement 3 of the lemma. The fact |Wt+i| < |VFj| is immediate from the definition 
of step t of the algorithm. When S is empty, | VFi+i| < | VFt| since a fictional player is deleted. When S is 
nonempty, we have shown above that ^i{P*^^) Dj fii{P^), thus completing the proof for statement 3 and 
for the whole lemma. D 

Theorem 3. For hierarchical node preferences, an equilibrium can be found in polynomial time. 

Proof. It is immediate from the definition of the algorithm and Lemma |2] that at termination, the algorithm 
returns a valid equilibrium. It remains to show that our algorithm terminates in polynomial time. Consider 
the potential given by the sum of | Wi| and the sum, over all i, of the position of iii{P^) in the preorder Zlj. 
The term \Wq\ is at most mm, where n is \V\ (which is at least the number of internal nodes) and m is the 
number of objects. Furthermore, since |0 x /| is at most nm, the initial potential is at most nm + ri^m. By 
Lemma[2j the potential decreases by at least one in each step of the algorithm. Thus, the number of steps of 
the algorithm is at most nm + 11? m. 

We now show that each step of the algorithm can be implemented in polynomial time. The initialization 
consists of adding the 0{nm) fictional players and computing the best response for each node i in V; 
the latter task involves, for each k in V, comparing at most m placements (one for each object). Each 
subsequent step of the algorithm involves the selection of a fictional player j, determination whether the 
set S is nonempty, and if so, computation of the node i, and then updating the placement. The only parts 
that need explanation are the computation of S and i - S is simply the set of all nodes k that are not in 
equilibrium when fictional player j is deleted. We compute S as follows: for each node k in V, if replacing 
the current object in their cache by a yields a more preferable placement (according to the utility) then add k 
to S. Thus, S can be computed in time polynomial in n. The node i is simply a node in S such that lca(i, j) 
is lowest among all nodes in S, and can be computed in time polynomial in n. This completes the proof of 
the theorem. D 

4 A general axiomatic framework for CSR games 

We now present a new axiomatic framework which generalizes the result of Section [3] to a broad class of 
utility functions, and also enables us to study the existence and complexity of equilibria in more general 
settings. 

Node preference relations. We assume that each node iinV has a total preorder >j among all the nodes 
in V|^ > j further satisfies i >i j for all i, j G V. We say that a node i prefers j over k if j > j k, and call a 
node j most i-preferred in a set S of nodes if j is in S and j > j k for all k in S. We also use the notation 
j =i k whenever j >j k and k >j j, and j >j k whenever it is not the case that k >j j. Note that >j is 

''a total preorder is a binary relation that satisfies reflexivity, transitivity, and totality. Totality means that for any i, j, k, either 
j >i kovk>i j. 



a strict weak ordeiQ and for any i, j, and k, we have exactly one of these three relations holding: j >j k, 
k >i j, k =i j. We also extend the notation ai{P, a) and cji{P-i,a) denote a most i-preferred node holding 
a in P and P-i, respectively, breaking ties arbitrarily. 

The access cost function d introduced in Section |2] induces a natural node preference relation: j >, k 
if dij < dik, and j =j k if dij = dik- In fact, as we show in Lemma |4] undirected networks (i.e., when 
the access cost function is symmetric) are equivalent to acyclic node preference collections. Formally, the 
collection {>j: i G V} is an acyclic node preference collection if there does not exist a sequence of nodes 
io, ii, . . . , ik-i for an integer A: > 3 such that i(j_i) ,^^^ ^ >ij ^ij+i) mod k for all < j < fc. 

Lemma 4. Any undirected network yields an acyclic node preference collection. For any acyclic node 
preference collection, we can compute, in polynomial time, symmetric cost functions that are consistent with 
the node preferences. 

Proof. Let d denote a symmetric access cost function over the set V of nodes. For a given node i ^ V, 
we have j >j k iff dij < d^^. We now argue that the collection {>j: i G V} is acyclic. Suppose, for the 
sake of contradiction, that there exists a sequence of nodes ig, ii, . . . , ik-i for an integer A; > 3 such that 
^0-1) mod k >ij i{j+i) mod k for all < j < fc. It then follows that: 

"jji(j_l) mod k < "«j«U + l) mod ft for < J < K. 

Since d is symmetric, we obtain 

'^«J«(j-l) mod k < *^«(j + l) mod kij fOJ" < J < ^) 

which is a contradiction. 

Given an acyclic collection of node preferences, we compute an associated access cost function d in 
polynomial time as follows. We construct a directed graph G over the set U of all unordered pairs {i,j) : 
i,j € V,i y^ j. There is a directed edge from node {i,j) to {i, k) if and only if k > j j. Since the collection 
{>«: i E V} is acyclic, G is a dag. We compute the topological ordering vr :[/—)• Z; thus, we have 
^{{'''^j)) < '^{{^j^)) whenever there is a directed path from {i,j) to {k,i). Setting dij to be 7r((i, j)) gives 
us the desired undirected network. D 

Utility preference relations. In our game-theoretic model, each node attaches a utility to each global 
placement. We present a general definition that allows us to consider a large class of utility functions 
simultaneously. Rather than define a numerical utility function, we present the utility at each node i as a 
total preorder ^j among the set of all global placements. (The notation >~i and =j over global placements 
are defined analogously.) We require that ^j, for each i ^ V, satisfies the following two basic conditions. 

• Monotonicity: For any two global placements P and Q, if, for each object a and each node q with 
a G Qg, there exists a node p with a ^ Pp and p > , q, then P ^j Q. 

• Consistency: Let {Pi,P^i) and {Qi,Q-i) denote two global placements such that for each object 

a € Pi U Qi, if p (resp., q) is a most i-preferred node in F \ {i} holding a, i.e., a £ Pp (resp., 
a e Qq), then p =i q. If (Pi, P_i) ^i {Qi, P-i), then (P, Q_i) hi {Qi, Q-i). 

In words, the monotonicity condition says that for any node, if all the objects in a placement are placed at 
nodes that are at least as preferred as in another placement, then the node prefers the former placement at 
least as much as the latter. The consistency condition says that the preference for a node to store one set of 

''a strict weak order is a strict partial order > (a transitive relation that is irreflexive) in which the relation "neither a > b nor 
b > a" is transitive. Strict weak orders and total preorders are widely used in microeconomics. 



objects instead of another is entirely a function of the set of most preferred other nodes that together hold 
these objects. For instance, if a node i with unit capacity prefers to store a over /3 in a scenario where the 
most i-preferred node (other than i) storing a (resp., /3) is j (resp., k), then i prefers to store a at least as 
much as /3 in any other situation where the most i-preferred node (other than i) storing a (resp., /?) is j 
(resp., k). 

Generality of the conditions. We note that many standard utility functions defined for replica placement 
problems ||6l|20j|^, including the sum and max functions, satisfy the monotonicity and consistency con- 
ditions. Indeed, any utility function that is an Lp norm, for any p, over the costs for the individual objects, 
also satisfies the conditions. Furthermore, since the monotonicity and consistency conditions apply to the 
individual utility functions, our model allows the different nodes to adopt different types of utilities, as long 
as each separately satisfies the two conditions. 

Binary object preferences. One class of utility preference relations that we highlight is the ones based on 
binary object preferences. Suppose that each node i has a set Si of objects in which it is equally interested, 
and it has no interest in the other objects. Let Ti{P) denote the [SjI -length sequence consisting of the 
(jj(P, a), for a G Si, in nonincreasing order according to the relation >,. Then, the consistency condition 
can be further strengthened to the following. 

• Binary Consistency: For any placements P = {Pi, P^i) and Q = {Qi, Q-i) with P_j = Q-i, we 
have P ^i Q if and only if for 1 < fc < jS'il, the kth component of tj(P) is at least as z-preferred as 
the kth component of rj(Q). 

CSR Games. In the general framework, a CSR game is a tuple {V, O, {>i}, {^i})- ^ (pure) Nash equilib- 
rium for an CSR game instance is a global placement P such that for each i ^ V there is no placement Qi 
such that {Qi, P_i) ^i {Pi, P_i). 

For our complexity results, we need to give the specification for a given game instance. The set V 
is specified, together with node cache capacities, and O is an enumerated list of object names. The node 
preference relation >j is specified succinctly by a set of at most (2) bits, for each i. The utility preference 
relation ^j, however, is over a potentially exponential number of placements (in terms of n, m, and cache 
sizes). For our complexity results, we assume that the utility preference relations are specified by an effi- 
cient algorithm - which we call the utility preference oracle - that takes as input a node i, and two global 
placements P and Q, and returns whether P ^j Q. For the sum, max, and Lp-norm utilities, the utility 
preference oracle simply computes the relevant utility function. For binary object preferences, the binary 
consistency condition yields an oracle that is polynomial in number of nodes, objects, and cache sizes. 

Unit cache capacity. We now argue that the unit cache capacity assumption of Section |2] continues to hold 
without loss of generality. Consider a set V of nodes in which the cache of node i can store Cj objects. Let 
V' denote a new set of nodes which contains, for each node i in V , new nodes ii, ^21 • • • 1 ic,> i-e-> one new 
node for each unit of the cache capacity of i. We set the node preferences as follows: for all i, i',j G V, 
1 < /, ^ < Cj. 1 < k, k' < a, we have ik >ji i'l^i whenever i >j i', and jf =i^ ji. 

We consider an obvious onto mapping / from placements in V to those in V. Given placement P' 
for V', we set f{P') = P where Pi = Ui<jt<c,-P/,. This mapping naturally defines the utility preference 
relations for the node set V'. In particular, for any i ^ V and I < k < Ci, P' ^j^. Q' whenever f{P') hi 
f{Q'). We also note that / is computable in time polynomial in the number of nodes and the sum of the 
cache capacities. It is easy to verify that the utility preference relation hi^ for all ik G V satisfies the 
monotonicity and consistency conditions. Furthermore, P' is an equilibrium for V if and only if f{P') is 
an equilibrium for V; this together with the onto property of the mapping / gives us the desired reduction. 



5 Existence of equilibria in the general framework 

In this section, we establish the existence of equilibria for several CSR games under the axiomatic frame- 
work of Section [4] We first extend our result for the sum utility function on hierarchical networks to the 



general framework (Section 5.1 1. We next show that CSR games for undirected networks and binary object 



preferences are potential games (Section 5.2 1. Finally, when there are only two objects in the system, we 
use the technique of fictional players to give a polynomial-time construction of equihbria for CSR games 
for undirected networks (Section [53] ). 

5.1 Hierarchical networks 

We now show that the polynomial time algorithm of Section|3]extends to the axiomatic framework we have 
introduced. In the general framework, a hierarchical network can be represented as a tree T whose set of 
leaves is the node set V and the node preference relation >j given by: j >j k if lca(i, j) is a descendant 
of lca(i, k). Our algorithm of Section [3] and its analysis are completely determined by the structure of the 
hierarchical network and the pair-preference relations Zl« defined for each node i; the latter were defined 
for the sum utility function. In order to extend our analysis to the general framework, it suffices to derive a 
new preference relation and establish the analogue of Lemma [T| which we now present for arbitrary utility 
preference relations satisfying the monotonicity and consistency properties. 

Pair preference relations. Given any utihty preference relation ^j that satisfies the monotonicity and 
consistency conditions, we define a strict weak order Dj on O x Aj, where Ai is the set of proper ancestors 
of i in T. 

1. For each object a, node i, and proper ancestors v and w of i, we have (a, v) Dj (a, w) whenever v is 
a proper ancestor of w. 

2. Consider distinct objects a, (3 and nodes i,j,k with j, k ^ i. Let V denote the set of global placements 
P such that j (resp., k) is a most i-preferred node in y \ {%} holding a (resp., /3) in P_j. If there exist 
global placements P = ({a},P_i) and Q = ({/?}, P_i) in V with P >~i Q, then (a,lca(i, j)) Dj 
W,lca{i,k)). 

In words, item [T] says that z's preference for keeping a in its cache increases as the most i-preferred 
node holding a becomes less preferred (or "moves farther away"). In item[2j {a, v) Dj (/3, w) means that 
if i needs to place either a or /? in its cache, and the least common ancestor of i and the most z-preferred 
node in F \ {i} holding a (resp., /3) is v (resp., w), then i prefers to store a over /?. The strict weak order 
Dj induces a total preorder Zlj as follows: (a, v) ^j (/3, w) if it is not the case that (/3, v) Zli (a, w). We 
similarly define =j: (a, v) =i (/3, w) if (q, v) 3j (/3, w) and (/3, v) =i (a, w). 

Lemma 5. For each i, Z2i as given above, is a well-defined strict weak order 

Proof. We need to ensure the well-definedness of part [2] of the definition of pair preference relations. That 
is, we need to show that for any placements P_j and Q^i such that a most i-preferred node in P_i hold- 
ing a (resp., /?) is also a most i-preferred node in Q-i, it is impossible that {{a},P-i) >~i {{(3},P^i) 
and ({/?}, Q-i) >~i ({a}, Q-i) both hold. This directly follows from the consistency condition for utility 
preference relations. 

The reflexivity and transitivity of Zlj are immediate from the definitions and the reflexivity and tran- 
sitivity of ^j. Finally, to ensure the well-definedness of the strict preorder Dj, we also have to show 
that there is no collection of pairs {aj,Vj), < j < £ for some integer i > 1, such that {aj,Vj) Dj 
(oj+i mod e, Vj+i mod i) for < j < £. To see this, it is sufficient to note that if (a, v) Zlj (a', v') then for 
all placements P and P' such that P_j = P'i- and the least common ancestor of i and the most i-preferred 



node in F \ {i} that holds a (resp., a') is v (resp., v') we have P >~i P'. So any cycle in the strict preorder 
Dj implies a cycle in >~i, yielding a contradiction. D 

Lemma 6. For any global placement P = ({q},P_j), if j (resp., k) is a most i-preferred node holding 
object a (resp., /3) in P^i, and ({/?}, P~i) >-i ({a}, -P^j), i.e., for node i, storing (3 is a better response to 
P-i than storing a, then (/3, lca{i, k)) Zli (a, lca{i, j)). Futhermore, for any global placement P, {a} is a 
best response to P-i, where a maximizes (7, lca{i, ai{P-i,^))), over all objects 7, according to Zlj. 

Proof. The first statement of the lemma directly follows from item 2 of the definition of pair preference 
relations. We establish the second statement by contradiction. Suppose that for node i, {/?} is a better 
response to P_j than {a]. Then, we have ({/3}, P-i) >-i {{a}, P-i), which, by item 2 of the definition of 
pair preference relations, implies that (/3,lca(i, crj(P_j,/3))) Zlj {a,\ca.{i, ai{P-i, a))), a contradiction to 
the choice of a. D 

The remainder of the analysis for hierarchical networks (Lemma |2] and Theorem [3]) follows as before, 
invoking Lemma [6] instead of Lemma [T] 

5.2 Undirected networks with binary object preferences 

Let d be a symmetric cost function for an undirected network over the node set V. Recall that for binary 
object preferences, we are given, for each node i a set Si of objects in which i is equally interested. Our 
proof of existence of equilibria is via a potential function argument. Given a placement P, let <l>j [P) = dij, 
where j is the most i-preferred node in y — {i} holding the object in P,;. We introduce the potential function 
$: ^{P) = {^0, <I>ii (P), ^i^ (P), . . . , ^j„ (P)), where $0 is the number of nodes i such that Pi C Si, and 
^i (P) < ^i +1 (P)^ Vj, where V = {zi, 12, . . . , in}- We prove that <1> is an increasing potential function: 
after any better response step, ^ increases in lexicographical order. 

Let P = (Pi,P^i) be an arbitrary global placement. Assume that Pj = {a} and j is the most i- 
preferred node in P_j holding a. Consider any better response step, from placement P to Q = [Qi, P-i), 
where Qi = {/3}. Clearly /3 £ Si. We consider two cases. First, suppose a ^ Si and /3 G Si. Then, <l>o 
increases, and so does the potential. The second case is where a,P £ Si. Let k be the most z-preferred 
node in P_j holding /3. In this case, $0 does not change. However, since this is a better response step of 
i, j >i k, implying that dik > dij and hence ^i{Q) > ^i{P). Consider any other node j. If j holds any 
object 7 other than /?, since no new copy of 7 has been added, ^j{Q) > ^j{P). It remains to consider the 
case where j holds j3. If S is the set of nodes in y \ {j} holding /3 in P_j, then S U {i} is the set of nodes 
in V" \ {j} holding /3. Thus, <^jiQ) = mm{<^ j{P),dji} > min{$j(P), $i(Q)}. This also means that 
$j(P) appears later in the sorted order than ^i{P) and ^j{Q) appears no earlier in the sorted order than 
^i{Q). Hence, ^{Q) is lexicographically greater than <&(P). This establishes that for undirected networks 
with binary object preferences, the resulting CSR game is a potential game, and hence also in PLS Iil3il . 

5.3 Undirected networks with two objects 

We give a polynomial-time algorithm for computing an equilibrium in any undirected network with two 



objects. Our algorithm uses the fictional player technique introduced in Section 5.1 It starts by introducing 
fictional players serving both the objects in the network at zero cost from each node. In each subsequent 
step, we move the fictional players progressively "further" away, ensuring that each instant, we have an 
equilibrium. Finally, when the fictional players are at least preferred cost from all the nodes, they can be 
removed yielding an equilibrium for the original network. 

Suppose we are given a undirected network with access cost function d. Also let V be the set {0, ^1 , £2 , • • • 
of all access costs between nodes in the system in increasing order; that is, £1 = mirij j dij and £r = 
maxjj- dij and ti < ii+i for all 1 < i < r. 
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Fictional player. For an object a, a fictional a-player is a new node tliat will store a in every equilibrium; 
an fictional a-player prefers storing a over any other object. We denote by srva(i) the fictional a-player 
which is at access cost £ from every node in V. 

The algorithm. 

Initialization. Assuming that there are two objects a and /3 in the system, we initially set up a fictional 
a-player srva{0) and ;9-player srf^(O) at access cost from each node in V. We let nodes replicate their 
most preferred object and access the other without any access cost from the corresponding fictional player. 
This placement is obviously an equilibrium. 

Step t of algorithm. Fix an equilibrium P for the node set V U {srva{it)} U {srVjs{it)}. We describe one 
step of the algorithm which computes a new set of fictional players srva{it+i) and srvplit+i) and a new 
placement P' such that P' is an equilibrium for the node set V U {srva{it+i)} U {srvi^{it+i)}. We first 
remove the a-player srva{it) from the system and instead we add srva{(^t+i)- If there do not exist nodes 
that want to deviate we are done. Otherwise, assume that there exists a node i that wants to deviate from 
its strategy. Since the most i-preferred node holding /3 in y U {srvai^t)} U {srvi3{it)} remains the same 
in y U srva{it+i) U srvp{lt), i is not holding object a. Thus the only nodes that may want to deviate are 
those that are holding object /?. We argue that if we let i to deviate from /? G Pj to a S P/, there is no node 
j £ V \ {i} that gets affected by i's deviation. Consider the following two cases: 

• If a node j has access cost at most £t from i, then /3 £ Pj. Otherwise, if a G Pj, srva{£t) would not 
be the most i-preferred node holding a and thus i would not be affected by any change of a-players. 
Thus there does not exist any node j e V \ {i} with access cost at most it from i, such that a G Pj, 
and as we showed above a ^ P',. 

• If a node j has access cost at least £t+i from i, then Pj = P'-. Because of the a-player srva{£t+i) 
and the ;5-player srvp{£t), i would never be the j'-most preferred node in P' . 

We then remove the /3-player srvj3{£t) from the system and instead we add srv i3{£tJ^i) . Using a similar 
argument as above, we obtain a new equilibrium at the end of this step. 

Theorem 7. For undirected networks with two objects, an equilibrium can be found in polynomial time. 

Proof. An initial placement P, where we have the set of fictional players srva (0) and srvp (0) in the system, 
is obviously an equilibrium. It is immediate from our argument above that at termination the algorithm 
returns a valid equilibrium. 

The size of the set V is at most (2) which is at most n?. In each step t at most n nodes may want to 
deviate from their strategy, since we showed above that if a node deviates once in a step, it will not deviate 
again during the same step . Thus, the total number of deviations in the algorithm is at most n^. D 

6 Non-Existence of equilibria in CSR games and the associated decision 
problem 

In this section, we show that the classes of games studied in Section|5]are essentially the only games where 
equilibria are guaranteed to exist. We identify the most basic CSR games where equilibria may not exist, 
and study the complexity of the associated decision problem. 

6.1 NP- Completeness 

The main result of this section is the following theorem, which establishes the NP-hardness of determining 
whether a given CSR game has an equilibrium even when the utility preference relations are based on the 
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sum utility function and either the number of objects is small or the object preferences are binary. The proof 
is by a polynomial-time reduction from 3SAT [TT|. Each reduction is built on top of a gadget which has an 
equilibrium if and only if a specified node holds a certain object. Several copies of these gadgets are then 
put together to capture the given 3SAT formula. 

Theorem 8. The problem of determining whether a given CSR instance has an equilibrium is in NP. It 
is NP-hard to determine whether an CSR intance has an equilibrium even if one of these three restrictions 
hold: (a) number of objects is two; (b) object preferences are binary and number of objects is three; (c) 
network is undirected and number of objects is three. 

The membership in NP is immediate, since one can determine in polynomial time whether a given global 
placement is an equilibrium. The remainder of the proof focuses on the hardness reduction from 3SAT. 

Given a 3SAT formula with n variables xi, X2, • • • , x„ and k clauses ci, C2, . . . , c^, we construct an 
CSR instance as follows. For each variable Xj in (j), we introduce two variable nodes: node Xi and Xj. 
We set dxx- and the symmetric d^ x ^^ ^^ 0-^' where d is the underlying access cost function. For each 
clause Cj, we introduce a clause node Cj. Assuming that £j ,,. for r G {1,2,3} are the three literals of clause 
Cj in formula (j), we set dc l r ^^'^ (^L rC to be 1 for r G {1, 2, 3}. Note that each Lj^g, for j G [1, fc], and 
s G {1, 2, 3} is in fact some variable node Xh or Xh for some h G [1, n]. We also introduce a gadget G 
illustrated in Figure [3} consisting of nodes S, A, B, and C. We set the access cost dsd and the symmetric 
^(7-5, for all 1 < i < /c between node S and all clause nodes to be 2. The general construction is illustrated 
in Figure [2] 





Figure 2: Instance of the construction for the undirected 

case proof of NP-Hardness, where (f) = {xi\J x^\J x^) A Figure 3: Gadget for the directed (left) and the undirected 

... A {Xn-3 V a;„_i V af„) (right) case 

Directed networks with two objects. We set dAs^ Aab, dsc^ dcA to be 1. The server node at access cost 
dsrv = 10 from all nodes in V, stores a fixed copy of two objects a and /?. We set the weights r^. (a), 
rx,{ct), fxiiP), TxiiP) of each variable node on objects a and /3, to be 1. For each clause node we set the 
weight rci (a) on object a to be 1, and the weight rd (/3) on object /3 to be 0.7, for all 1 < i < A;. We set 
the weight rs{a) on object a to be 1, and the weight rs{f3) on object /3 to be 0.7. We set the weight rA{a) 
of node A on object a to be 0.7 and the weight rA{P) of node A on object /3 to be 1. Finally, we set the 
weight of nodes B and C on both objects a, and /? to be 1. We refer to this CSR instance as Ii. 
Undirected networks with three objects. For the undirected case, we set dAs and dss to be 3, dAB to 
be 3.1, dsc to be 3.05, and dcA to be 2; while symmetry holds. The server node, which is at access cost 
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dsrv = 5 from all nodes in V, stores a fixed copy of three objects a, (3, and 7. We set the weights r^;. (a), 
Tx^ (a), rx^ (/3), r^, (/3) of each variable node on objects a and /?, to be 1. Also, for each clause node we set 
the weight re. (a) on object a to be 0.85, and the weight rc^iP) on object /3 to be 1, for all 1 < i < A;. 
We set the weight rs{a) on object a to be 0.85, and the weight rs{fi) on object /3 to be 1. Also we set the 
weight rA{a) = 1, rA{l) = 2, rsiP) = 1, rB{^) = 0.9837, rc(/3) = 1, and rc(7) = 1.6. We set all the 
remaining weights to be 0. We refer to this CSR instance as I2. 

Lemma 9. A variable node Xi holds object a (resp., f3) if and only if node Xi holds object /3 (resp., a). 

Proof. The proof is immediate, since Xi (resp., Xi) is Xj's (resp., Xj's) nearest node, and both Xi and Xi 
are interested equally in a and /3. D 

Lemma 10. Clause node Ci holds object a if and only if its variable nodes Lij, for j G {1, 2, 3} hold 
object f3. 

Proof First, assume that Lij, for j G {1,2,3} hold /3. These nodes are Cj's nearest nodes holding /3. 
By LemmaMwe know that nodes Lj .,, for j G {1, 2, 3} hold a, and they are Cj's nearest nodes holding a. 
Node's d cost for holding a and accessing /3 from Ljj, for j £ {1, 2, 3}, is r^ {P)dciLi =1-1 = 1; while 
the cost for holding /3 and accessing a from Ljj, for j G {1, 2, 3}, is rs{a)d(j_j;;^^ = 0.85 • 1.5 = 1.275. 
Obviously, node Ci prefers to replicate a. 

Now assume that at least one of the nodes Lj j, for j G {1, 2, 3} holds a. These nodes are Cj's nearest 
nodes holding a. Also, by Lemma |9j Cj's nearest nodes holding /3 are all the remaining nodes from the set 
Lij, Lij, for j G {1, 2, 3}, that don't hold a. Node's Ci cost for holding (3 and accessing a from Lj j, for 
j G {1, 2, 3}, is rc^{a)dciLi = 0.85 • 1 = 0.85; while the cost for holding a and accessing /3 from node 
Lij (resp., Lij), is rc^{l3)d(j^L~^j = 1 • 1.5 = 1.5 (resp., rc^{/3)dc,Lij = 1-1 = 1). Obviously, in any case 
node Ci prefers to replicate /3. D 

Lemma 11. Node S holds object a if and only if all clause nodes Ci, . . . ,Ck hold object f3. 

Proof First, assume that Ci, . . . , Cfc are holding /3. These nodes are 5's nearest nodes holding /3. Also by 



Lemma 10 5's nearest node holding a is at least one of Lij nodes, where i G [1, k],j G {1, 2, 3}. The 
cost for S holding a and accessing /3 from a node Ci,i G [1, k], is rs{fi)dsCi = 1-2 = 2; while the cost 
for holding /3 and accessing a from Lij, where i G [1, A:], j G {1, 2, 3}, is rs{a)dsLi j = 0.85 - 3 = 2.55. 
Obviously, node 5 prefers to replicate a. 

Now assume that at least one of Ci, . . . , C^ holds a. These nodes are S"s nearest node holding a. Also 



S"s nearest node holding (3, due to Lemma 10 is one of Lij, where i G [1, k],j G {1, 2, 3}. The cost for 
holding P and accessing a from a node d, is rs{a)dsCi = 0.85 - 2 = 1.7; while the cost for holding a and 
accessing /3 from a node Lij, where j G {1, 2, 3}, is rs{(3)dsLi j = 1 " 3 = 3. Obviously, in any case node 
S prefers to replicate /3. D 

Theorem 12. The CSR instance Ii has an equilibrium if and only if node S holds object a. 



Proof. First, assume that S is holding a. By Lemma 1 1 nodes Ci, . . . ,Ck hold object /3, and by Lemma 10 



at least one of nodes Lij, for j G {1,2,3} for each node Ci,i G [1, A;], holds object a, and the corresponding 
Lij is holding object /3. We claim that the placement where A holds /3, B holds (3, and C holds a, is a pure 
Nash equilibrium. We prove this by showing that none of these nodes wants to deviate from their strategy. 

Node A does not want to deviate since its cost for holding object (3 and accessing a from A's nearest 
node S, is rA{a)dAs = 0.7 - 1 = 0.7; while the cost for holding object a and accessing /? from A's nearest 
node B, is rA{a)dAB = 1-1 = 1. Node B does not want to deviate since its cost for holding object (3 and 
accessing a from B's nearest node C, is rB{a)dBc = 1-1 = 1; while the cost for holding object a and 
accessing (3 from B's nearest node A, is rB{(3)dBA = 1-2 = 2. Node C does not want to deviate since 
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its cost for holding object a and accessing /3 from C's nearest node A, is rc{fi)dcA = 1-1 = 1; while the 
cost for holding object j3 and accessing a from C's nearest node S, is rc{a)dcs = 1-2 = 2. Also note that 
none of 5, Ci, . . . , Cfc, Lij, Lij f or i G [1, A;], j G {1, 2, 3} is getting affected of the objects been holded by 
the gadget nodes. 

Now assume that node S holds object /3. We are going to prove that for every possible placement over 
nodes A, B, and C, at least one node wants to deviate from its strategy. Consider the following cases: 

• Nodes A, B, and C hold object a: Node B (resp., C) wants to deviate, since the cost for holding 
object a and accessing /3 from B\ (resp., C's) nearest node S, is rB{(3)dBS = 1-3 = 3 (resp., 
rc{/3)dcs = 1-2 = 2); while the cost for holding object /3 and accessing a from B's nearest node 
A, is rB{^)dBA = 1-2 = 2 (resp., rc{^)dcA = 1-1 = 1). 

• Two nodes hold object a and the third holds /3: In the case where A and B hold a, A wants to deviate 
since the cost while holding a and accessing /3 from A's nearest node S is rA{P)dAs = 1-1 = 1; 
while the cost for holding /3 and accessing a from yl's nearest node B is rA{a)dAB = 0.7-1 = 0.7. In 
the case where A and C hold a, then C wants to deviate since the cost while holding a and accessing /3 
from C's nearest node B is rc{ji)dcB = 1-2 = 2; while the cost for holding /3 and accessing a from 
C's nearest node A is rc{a)dcA = 1 - 1 = 1. In the case where B and C hold a, B wants to deviate 
since the cost while holding a and accessing /3 from i?'s nearest node A is rB{/3)dBA = 1-2 = 2; 
while the cost for holding /? and accessing a from B's nearest node C is rB{a)dBC = 1-1 = 1- 

• One node holds a: If A (resp., B, or C) holds a, B (resp., C, A) wants to deviate since the cost 
while holding /? and accessing a from B's (resp., C's, or A's) nearest node A (resp., B, or C) is 
rB{a)dBA = 1-2 = 2 (resp., rc{a)dcB = 1 - 2 = 2, or rA{a)dAc = 0.7 - 2 = 1.4); while the 
cost for holding a and accessing /? from S's (resp., C's, or A's) nearest node C (resp.. A, or 5), is 
rB{P)dBC = 1-1 = 1 (resp., rc{f3)dcA = 1 - 1 = 1, or rA{P)dAB = 1-1 = 1). 

• Nodes A, B, and C hold /3: All of them want to deviate. Node A wants to deviate since the cost while 
holding /3 and accessing a from yl's nearest node Cj, for some i G [1, A;], is rA{P)dACi = 1-3 = 3; 
while the cost for holding /3 and accessing a from A's nearest node S is rA{a)dAs = 0.7 - 1 = 0.7. 
Similar proof holds for nodes B and C. 

Obviously the system does not have a pure Nash equilibrium, which completes the proof. D 

Theorem 13. The CSR instance I2 has an equilibrium if and only if node S holds object a. 



Proof. First, assume that S is holding a. By Lemma 1 1 nodes Ci , . . . , C^ hold object /3, and by Lemma 10 



at least one of nodes Lij, for j G {1,2,3} for each node Ci,i G [1, A;], holds object a, and the corresponding 
Ljj is holding object /3. We claim that the placement where A holds 7, node B holds /3, and C holds 7 
is a pure Nash equilibrium. We prove this by showing that none of these nodes wants to deviate from 
their strategy. Node A doesn't want to deviate since the cost for holding object 7 and accessing object a 
from node S is rA{(y)dAS = 3; while the cost for holding a and accessing 7 from node C increases to 
'r'A{l)dAC = 4. Node B doesn't want to deviate since the cost for holding object /3 and accessing object 7 
from node C is rB{'y)dBC = 0.9837 - 3.05 = 3.000285; while the cost for holding object /3 and accessing 7 
from the server increases to tb {'~l)dsrv = 5. Node C doesn't want to deviate since the cost for holding object 
7 and accessing j3 from node B is rc{P)dcB = 3.05; while the cost for holding object /3 and accessing 7 
from node A increases to rc{/3)dcA = 3.2. 

Now assume that node S holds object /3. We are going to prove that for every possible placement over 
nodes A, B, and C, at least one node wants to deviate from its strategy. Consider the following cases: 
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• Node A holds a, node B holds 7, and node C holds /3: Node A wants to deviate since the cost while 
it is holding object a and accessing object 7 from node B is {rA{'y)dAB = 6.2); while the cost for 
holding object 7 and accessing a from the server decreases to rA{a)dsrv = 5. 

• Node A holds 7, node S holds 7, and node C holds /3: Node S wants to deviate since the cost while 
it is holding object 7 and accessing object /3 from node C is {rB{l3)dBC = 3.05); while the cost for 
holding object /3 and accessing 7 from node A decreases to rB{'y)dBA = 3.04947. 

• Node A holds 7, node B holds /?, and node C holds /?: Node C wants to deviate since the cost while 
it is holding object /3 and accessing object 7 from node A is {rc{'y)dcA = 3.2); while the cost for 
holding object 7 and accessing /3 from node B decreases to rc{(3)dcB = 3.05. 

• Node A holds 7, node B holds /3, and node C holds 7: Node A wants to deviate since the cost while 
it is holding object 7 and accessing object a from the server is {rA{c()dsrv = 5); while the cost for 
holding object a and accessing 7 from node C decreases to rA{'j)dAc = 4. 

• Node A holds a, node i? holds /3, and node C holds 7: Node S wants to deviate since the cost while 
it is holding object /3 and accessing object 7 from node C is {rB{l)dBC = 3.000285); while the cost 
for holding object 7 and accessing /3 from node S decreases to rB{P)dBs = 3. 

• Node A holds a, node B holds 7, and node C holds 7: Node C wants to deviate since the cost while 
it is holding object 7 and accessing object /3 from the server is {rc{P)dsrv = 5); while the cost for 
holding object /3 and accessing 7 from B decreases to rc{'y)dBC = 4.88. 

• Node A holds a, node B holds /3, and node C holds /3: Node C wants to deviate since the cost while 
it is holding object /3 and accessing object 7 from the server is {rc{'y)dsrv = 4.9185); while the cost 
for holding object 7 and accessing /3 from node B decreases to rB{fi)dcB = 3.05. 

• Node A holds 7, node B holds 7, and node C holds 7: Node A wants to deviate since the cost while 
it is holding object 7 and accessing object a from the server is {rA{ci)dsTv = 5); while the cost for 
holding object a and accessing 7 from C decreases to r^ (7)^^(7 = 4. 

The remaining placements where A holds a, B holds a, and C holds a, obviously are not stable since none 
of the nodes are interested in these objects. Since there does not exist a stable placement, an equilibrium 
does not exist. D 

Binary object preferences over three objects.. For the binary object preferences, we introduce two extra 
nodes K and L. We set dc^K, for i G [1, k], between clause nodes and K to be 1.4, dsL to be 2.1, and d^s, 
dAB, dBC^ dcA to be 1. The server node, which is at access cost dgrv = 10 from all nodes in V , stores a 
fixed copy of three objects a, /3, and 7. Each node i has a set Si of objects in which it is equally interested. 
For nodes Xi, Xi, for i G [1, n], we set Sx^ = {a, /3} and Sx = {a, /5}. For nodes d, for i G [1, k], we 
set Sc^ = {a, 7}. For node K we set Sk = {7}; while for node L we set Sl = {/3}. For node S* we set 
Ss = {") /3}. For nodes A, B, and C we set 5^1, 5b, and Sc correspondingly to be the set {a, 7}. As we 
mentioned in the binary object preference definition for our utility function Us{i), equally interested means 
weight 1 for all objects in Si, and for the remaining. We refer to this instance as I^. 
Lemma|9]holds as it is for the binary object preferences directed case. 

Lemma 14. Clause node d holds object a if and only if its variable nodes Lij, for j € {1, 2, 3} hold 
object /3. 



15 



Proof. First, assume that Lj j, for j G {1, 2, 3} hold j3. By Lemma ^ we know that nodes Lj j, for j S 
{1, 2, 3} hold a, and they are Cj's nearest nodes holding a; while Cj's nearest node holding 7 is node K. 
Node's Ci cost for holding a and accessing 7 from i^ is ddK = 1-4; while the cost for holding 7 and 
accessing a from Ljj, for j G {1, 2, 3}, is d(jjj. =1.5. Obviously, node Ci prefers to replicate a. 

Now assume that at least one of the nodes Ljj, for j G {1,2,3} holds a. These nodes are Cj's nearest 
nodes holding q; while again Cj's nearest node holding 7 is node K. Node's Cj cost for holding 7 and 
accessing a from L^j, for j G {1, 2, 3}, is ddhi = 1; while the cost for holding a and accessing 7 from 
node K is dc^ii' = 1-4. Obviously, node Cj prefers to replicate 7. D 

Lemma 15. Node S holds object a if and only if all clause nodes Ci, . . . ,Ck hold object 7. 



Proof. First, assume that Ci, . . . , C^ are holding 7. By Lemma [14} 5"s nearest node holding a is at least 
one of Lj j nodes, where i G [1, k],j G {1, 2, 3}; while 5"s nearest nodes holding /3 is node L. The cost for 
S holding a and accessing /? from node L, is d^i = 2.1; while the cost for holding /3 and accessing q from 
Lj j, where i G [1, A;], j G {1, 2, 3}, is dsLi j = 3. Obviously, node S prefers to rephcate a. 

Now assume that at least one of Ci, . . . , C^ holds a. These nodes are 5's nearest node holding a; while 
again 5's nearest node holding /3 is L. The cost for holding /3 and accessing a from a node Ci, is d^c". = 2; 
while the cost for holding a and accessing /3 from a node L is (i5j^ = 2.1. Obviously, node S prefers to 
replicate /3. D 

Theorem 16. There exists an equilibrium for the CSR instance I3 if and only if node S holds object a. 



Proof. First, assume that S is holding a. By Lemma 15 nodes Ci, . . . , C^ hold object 7, and by Lemma 14 



at least one of nodes Lj j, for j G {1,2,3} for each node Ci,i G [1, A;], holds object a, and the corresponding 
Lij is holding object /3. We claim that the placement where A holds 7, B holds 7, and C holds a, is a pure 
Nash equilibrium. We prove this by showing that none of these nodes wants to deviate from their strategy. 

Node A does not want to deviate since its cost for holding object 7 and accessing a from A's nearest 
node S, is dAs = 1; while the cost for holding object a and accessing 7 from A's nearest node B, is still 
dAB = 1- Node B does not want to deviate since its cost for holding object 7 and accessing a from B's 
nearest node C, is dsc = 1; while the cost for holding object a and accessing 7 from B's nearest node A, 
is still dsA = 1- Node C does not want to deviate since its cost for holding object a and accessing 7 from 
C's nearest node A, is dcA = 1; while the cost for holding object 7 and accessing a from C's nearest node 
S, is still dcs = 1- Also note that none of 5, Ci, . . . , Ck,Lij, Lij for i G [1, k],j G {1, 2, 3} is getting 
affected of the objects been holded by the gadget nodes. 

Now assume that node S holds object /3. We are going to prove that for every possible placement over 
nodes A, B, and C, at least one node wants to deviate from its strategy. Consider the following cases: 

• Nodes A, B, and C hold object a: Node B (resp., C) wants to deviate, since the cost for holding 
object a and accessing 7 from i?'s (resp., C's) nearest node Cj, for some i G [1, k] or from node K, 
is dsCi = 5 or dsK = 6.4 (resp., dcc^ = 4 or dcK = 5.4); while the cost for holding object 7 and 
accessing a from B's nearest node A, is dBA = 2 (resp., dcA = !)■ 

• Two nodes hold object a and the third holds 7: In the case where A and B hold a, A wants to deviate 
since the cost while holding a and accessing 7 from A's nearest node C is dAc = 2; while the cost 
for holding 7 and accessing a from A's nearest node B is dAB = 1- The other cases are symmetric. 

• One node holds a: If A holds a, B wants to deviate since the cost while holding 7 and accessing a 
from B's nearest node A is dBA = 2; while the cost for holding a and accessing 7 from B's nearest 
node C, is dBc = 1- The other cases are symmetric. 
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• Nodes A, B, and C hold 7: All of them want to deviate. Node A wants to deviate since the cost while 
holding 7 and accessing a from A's nearest node d, for some i G [1, A;], is dAd = 3; while the cost 
for holding a and accessing 7 from As nearest node B is dAB = 1- The other cases are symmetric. 

Obviously the system does not have a pure Nash equilibrium, which completes the proof. D 

We now show that cj) is satisfiable if and only if the above CSR games (both undirected and directed 
cases) (resp., for the binary object preferences, directed case) has a pure Nash equilibrium. Suppose that 
is satisfiable and consider a satisfying assignment for 0. If the assignment of a variable xi is True, then we 
replicate object a in cache of variable node Xi ; otherwise, we replicate object /3. By Lemma [9] we know 
that a variable node X-i holds object a (resp., /3) if and only if node Xj holds object /3 (resp., a). In this 
way we keep the consistency between truth assignment of a variable and its negation. By Lemma [T0| (resp . , 



Lemma 14 1 we know that a clause node Cj, will replicate object /3 (resp., 7) if and only if at least one of its 
variable nodes, holds object a. From above, any clause node Cj will hold object /3 (resp., 7) only if at least 
one of clause Cj literals is True. By Lemma[TT](resp., Lemma[T5]), we know that node S, will replicate object 
a if and only if all clause nodes Ci , . . . , Cfc are holding object /3 (resp., 7). Thus, node S replicates object a 
only if all clauses ci, . . . , Cjt are True. By Theorems 12 and [73] (resp . , [76]) , we know that there exists a pure 



Nash Equilibrium if and only if object /3 is stored to node S\ thus, there exists a pure Nash Equilibrium if 
and only if all clauses are True. This gives our proof. 

6.2 Binary preferences over two objects 

Consider the problem 2BIN: does a given CSR instance with two objects and binary preferences possess 
an equilibrium? We prove that 2BIN is polynomial-time equivalent to the notorious EVEN-CYCLE prob- 
lem [34J: does a given digraph contain an even cycle? Despite intensive efforts, the complexity of the 
problem EVEN-CYCLE was open until Il23ll29l provided a tour de force polynomial-time algorithm. Our 
result thus also places 2BIN in P. 

Theorem 17. EVEN-CYCLE is polynomial-time equivalent to 2BIN. 

We prove the polynomial-time equivalence of 2BIN and EVEN-CYCLE by a series of reductions. 
We first show the equivalence between 2BIN and 2DIR-BIN, which is the sub-class of 2BIN instances 
in which the node preferences are specified by an unweighted directed graph (henceforth digraph); in a 
2DIR-BIN instance, we are given a digraph, and the preference of a node for the other nodes increases with 
decreasing distance in the graph. 

Lemma 18. 2BIN is polynomial-time equivalent to 2DIR-BIN. 

Proof. Given a 2BIN instance / with node set V , two objects, node preference relations {>j: i G V}, 
and interest sets {Si : i ^ V} , snq construct a 2DIR-BIN instance /' with the same node set, objects, and 
interest sets, but with the node preference relations specified by an unweighted digraph G. Our construction 
will ensure that any equilibrium in / is an equilibrium in /' and vice- versa. For distinct nodes i and j, we 
have an edge from i to j if and only if j is a most i-preferred node in 1/ \ {i}. We now argue that / has 
an equilibrium if and only if /' has an equilibrium. A placement for / is an equilibrium if and only if the 
following holds for each node i: (a) if \Si\ = 1, then i holds the lone object in Sf, (b) if \Si\ = 2, then the 
object not held by i is at an z-most preferred node. Similarly, any equilibrium placement for /' satisfies the 
following condition for each i: (a) if |5j| = 1, then i holds the lone object in Si; (b) if |Sj| = 2, then the 
object not held by i is at a neighbor of i. By our construction of the instances, equilibria of I are equilibria 
of /' and vice- versa. D 
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We next define EXACT-2DIR-BIN, which is the subclass of 2DIR-BIN games where each node is 
interested in both objects; thus, an EXACT-2DIR-BIN instance is completely specified by a digraph G. 
We say that a node i is stable in a given placement P if Pi is a best response to P_j. We say that an 
EXACT-2DIR-BIN instance G is stable (resp., 1-critical) if there exists a placement in which all nodes 
(resp., all nodes except at most one) are stable. Since each node has unit cache capacity, each placement is a 
2-coloring of the nodes: think of a node as colored by the object it holds in its cache. Given a placement, an 
arc is said to be bichromatic if its head and tail have different colors. Note that for any EXACT-2DIR-BIN 
instance, a node is stable in a placement iff it has a bichromatic outgoing arc. 

Lemma 19. 2DIR-BIN and EXACT-2DIR-BIN are polynomial-time equivalent on general digraphs. 

Proof. Since EXACT-2DIR-BIN games are a special subclass of 2DIR-BIN games, we only need to show 
that 2DIR-BIN games reduce to EXACT-2DIR-BIN games. Given an instance of a 2DIR-BIN game, 
we need to handle the nodes that are interested in at most one object. First, note that we can remove the 
outgoing arcs from all such nodes. Let Vq consist of the nodes with no objects of interest. For each node 
u in Vq we add a new node uq to Vq along with arcs {u,uq) and (uq, u). Let red and blue denote the two 
objects. Let Vr and VJ, denote the set of nodes interested in red and blue, respectively. Without loss of 
generality, let |T4-| > |H|- Add \Vr\ — \Vh\ additional nodes to the set Vh (so that \Vr\ = \Vb\) and connect 
all the nodes in Vr |J Vj, with a directed cycle that alternates strictly between Vr nodes and V^ nodes. The 
rest of the network is kept the same and all the nodes are set to have interest in both objects. Now, if the 
original instance is stable then we can stabilize the new instance by having each node in Vr (resp., VJ,) cache 
the red (resp., blue) object, the nodes in Vq cache any object (so long as an original node u and its associated 
node uo store complementary objects) and the other nodes cache the same object as in the placement that 
made the original instance stable. And in the other direction, if the transformed instance is stable then in an 
equilibrium placement, the nodes in Vr must each store an object of one color while each node in Vh stores 
the object of the other color. By renaming the colors, if necessary, we get a stable coloring (placement) for 
the original instance. D 

For completeness, we next present some standard graph-theoretic terminology that we will use in our 
proof. A digraph is said to be weakly connected if it is possible to get from a node to any other by following 
arcs without paying heed to the direction of the arcs. A digraph is said to be strongly connected if it is 
possible to get from a node to any other by a directed path. We will use the following well-known structure 
result about digraphs: a general digraph that is weakly connected is a directed acyclic graph on the unique 
set of maximal strongly connected (node-disjoint) components. We will also use the following strengthening 
of the folklore ear-decomposition of strongly connected digraphs |[30l : 

Lemma 20. An ear-decomposition can be obtained starting with any cycle of a strongly connected digraph. 

Proof. The proof is by contradiction. Suppose not, then consider a subgraph with a maximal ear-decomposition 
obtainable from the cycle in question. If it is not the entire digraph then consider any arc leaving the sub- 
graph. Note that the digraph is strongly connected and hence such an arc must exist. Further, note that every 
arc in a digraph is contained in a cycle since there is a directed path from the head of the arc to the tail. 
Starting from the arc follow this cycle until it intersects the subgraph again, as it must because it ends at the 
tail which lies in the subgraph. This forms an ear that contradicts the maximality of the decomposition. D 

Lemma 21. EVEN-CYCLE on strongly connected digraphs and EVEN-CYCLE on general digraphs 
are polynomial-time equivalent. 

Proof. Since strongly connected digraphs are a special subclass of general digraphs it suffices to show that 
EVEN-CYCLE on general digraphs can be reduced to EVEN-CYCLE on strongly connected digraphs. 
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Remember that a general digraph has a unique set of maximal strongly connected components that are dis- 
joint and computable in polynomial-time. Further any cycle, including even cycles, must lie entirely within 
a strongly connected component. Thus a digraph possesses an even cycle iff one of its strongly connected 
components does. Hence it follows that EVEN-CYCLE on general digraphs reduces to EVEN-CYCLE 
on strongly connected digraphs. D 

Lemma 22. EVEN-CYCLE and EXACT-2DIR-BIN games are polynomial-time equivalent on strongly 
connected digraphs. 

Proof. To show the polynomial-time equivalence, we show that a strongly connected digraph is stable iff 
it has an even cycle. One direction is easy. If the digraph is stable then consider the placement in which 
every node is stable. So every node has a bichromatic outgoing arc; by starting at any node and following 
outgoing bichromatic edges we will eventually loop back on ourselves. The loop so obtained is the required 
even cycle; it is even because it is composed of bichromatic arcs. In the other direction, if there is an 



even cycle then we take the ear-decomposition starting with that cycle (Lemma 20 1, stabilize that cycle 
(by making each arc bichromatic since it is of even cardinality) and then stabilize each node in each ear by 
working backwards along the ear. D 

Lemma 23. Any EXACT-2DIR-BIN game on a strongly connected digraph is 1-critical. 

Proof. Consider an ear-decomposition of the strongly connected digraph starting with a cycle. Observe that 
all but at most one node of the cycle can be stabilized by arbitrarily assigning one color to a node, and then 
assigning alternate colors to the nodes as we progress along the cycle. Every node in the cycle, other than 
possibly the initial node, is stable. The rest of the digraph can be stabilized ear by ear, stabilizing each 
ear by working backwards from the point of attachment. Hence, all but one node of the digraph can be 
stabilized. D 

Lemma 24. EXACT-2DIR-BIN on general digraphs is polynomial-time equivalent to EXACT-2DIR-BIN 

on strongly connected digraphs. 

Proof. Since strongly connected digraphs are a subclass of general digraphs we need only show that the 
problem EXACT-2DIR-BIN on general digraphs reduces to EXACT-2DIR-BIN on strongly connected 
digraphs. A general digraph is stable iff all of its weakly connected components are. A weakly connected 
component is a directed acyclic graph (dag) on the strongly connected components. It is clear that a weakly 
connected component cannot be stabilized if any one of the strongly connected components that is a minimal 
element of the directed acyclic graph cannot be stabilized. Interestingly, the converse is also true. If all of 
the strongly connected components that are minimal elements of the dag can be stabilized then the entire 
weakly connected component can be stabilized because each of the other strongly connected components 
has at least one outgoing arc which is used to stabilize its tail while the rest of the strongly connected 
component can be stabilized because strongly connected components are 1-critical by Lemma |23] We can 
determine such a stable placement by processing the strongly connected components in topologically sorted 
order (according to the dag) starting from the minimal elements. Thus a digraph is stable iff every strongly 
connected component that is a minimal element is stable. Hence, EXACT-2DIR-BIN on general digraphs 
is reducible in polynomial-time to strongly connected digraphs. D 

7 Fractional replication games 

We introduce a new class of capacitated replication games where nodes can store fractions of objects, as 
opposed to whole objects, and satisfy an object access request by retrieving enough fractions that make up 
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the whole object. Rather than associate different identities with different fractions of a given object, we view 
each portion of an object as being fungible, thus allowing any set of fractions of an object, adding up to at 
least one, to constitute the whole object. Such fractional replication scenarios naturally arise when objects 
are encoded and distributed within a network to permit both efficient and reliable access. 

Several implementations of fractional replication, in fact, already exist. For instance, fountain codes llH 
[BTI and the information dispersal algorithm (2S] present two ways of encoding an object as a number of 
smaller pieces - of size, say 1/m fraction of the full object size, where m is an integer - such that the full 
object may be reconstructed from any m of the pieces. A natural formalization is to view each object as a 
polynomial of high degree, and consider each piece of the object as the evaluation of the polynomial on a 
random point in a suitable large field. Then, accessing an object is equivalent (with very high probability) 
to accessing a sufficient number of pieces of the object. 

We now present fractional capacitated selfish replication (F-CSR) games, which are an adaptation of the 
game-theoretic framework developed in Section|4]to fractional replication. We have a set V of nodes sharing 
a set O of objects. In an F-CSR game, the strategies are. fractional placements; a fractional placement P is 
a |l/|-tuple {Pi : i £ V} where P^ : O — )■ 3f? under the constraint that sum of Pi{a), over all a in O, is at 
most the cache size of i. 

We begin by presenting F-CSR games in the special case of sum utilities, where the generalization from 
the integral to the fractional setting is most natural. For sum utilities, recall that we are given a cost function 
d and node-object weights rj(a), i £ V, a £ O. Given a fractional global placement P, we define the cost 
incurred by i for accessing object a as the minimum value of Xjdij under the constraints that ^ Xj = 1 

and Xj < Pj{a) for all j. Then, the total cost incurred by i is the sum, over all objects a, of ri{a) times the 
cost incurred by i for accessing a. For a given fractional global placement P, the utility of i is the negative 
of the total cost incurred by i under P. 

We now consider F-CSR games under the more general setting of utility preference relations. As 
before, each node i has a node preference relation >j and a preference relation ^j among global (integral) 
placements. Recall that the node and placement preference relations of each node i induce a preorder Zl j 
among the elements of O x (V \ {i}) (see Section [4]). For F-CSR games, we require the existence of a 
total preorder Zl j, for all i. We now specify the best response function for each player for a given fractional 
global placement P. For each node i and object a, we determine the assignment fJ-^p^ '■ V \ {i} — )• 
3? that is lexicographically minimal under the node preference relation >i subject to the condition that 
Aij P Q < -Pfc(a) for each k and J2k A^j p a^^^ ~ ^- ^^ '^^^^ compute 6^ p : O x (y \ {%)) — ^ 3f? to be 
the lexicographically maximal assignment under Zlj subject to the condition that 6^ p(a, /c) < /i^ p oS^) for 
all k and ^^ ^ 6^ p(a, ^) is at most the size of i's cache. The best response of a player i is then to store 
l^fc ^i p(o^; ^) of Q! in their cache. This completes the definition of F-CSR games. 

Using standard fixed-point machinery, we show that every F-CSR game has an equilibrium. We also 
show that finding equilibria in F-CSR games is PPAD-complete. 

Theorem 25. Every F-CSR instance has a pure Nash equilibrium. Finding an equilibrium in an F-CSR 
game is PPAD-complete. 

We prove Theorem|25]by establishing separately the existence of equilibria, membership in PPAD, and 
the PPAD-hardness of finding equilibria. 

7.1 Existence of equilibria 

Theorem 26. Every F-CSR instance has a pure Nash equilibrium. 

Proof. By f25l, a game has a pure Nash equilibrium if the strategy space of each player is a compact, 
non-empty, convex space, and the payoff function of each player is continuous on the strategy space of all 
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players and quasi-concave in the strategy space of the player. In an F-CSR instance, the strategy space of 
each player i is simply the set of all its fractional placements: that is, the set of functions / : O — ;■ [0, 1] 
subject to condition that ^^^.^o f(^) — ^i^ where q is the cache size of the node (player). The strategy set 
thus is clearly convex, non-empty, and compact. Furthermore, as defined above, the payoff for any player i 
under fractional placement P is simply the solution to the following linear program: 

max- ^ ri{a)C^Xij{a)dij) 

22i ^ij i'^) = 1 for alH G y, a G O 

Xij{a) < Pj{a) for all i, j G V, a G O 
Xij{a) > for all i,jeV,aeO 

It is easy to see that the payoff function is both continuous in the placements of all players, and quasi-concave 
in the strategy space of player i, thus completing the proof of the theorem. D 

7.2 Membership in PPAD 

Theorem 27. Finding an equilibrium in an F-CSR game is in PPAD. 

Proof. Our proof is by a reduction from FSPP (Fractional Stable Paths Problem), which is defined as fol- 
lows flS]. Let G be a graph with a distinguished destination node d. Each node v ^ d has a list ^{v) of 
simple paths from f to d and a preference relation >^ among the paths in t:{v). For a path S, we also define 
'k{v,S) to be the set of paths in ^{v) that have S" as a suffix. A proper suffix 5 of P is a suffix of P such 
that S ^ P and S ^ $. K feasible fractional paths solution is a set w = {w^ : v ^ d} of assignments 
w^ : 7r(t>) — )• [0, 1] satisfying: (1) Unity condition: for each node v, J^peiriv) '^v{P) < 1. and (2) Tree 
condition: for each node v, and each path S with start node u, J^Peniv s) "^viP) < Wu{S). 

In other words, a feasible solution is one in which each node chooses at most 1 unit of flow to d such 
that no suffix is filled by more than the amount of flow placed on that suffix by its starting node. A fea- 
sible solution w is stable if for any node v and path Q starting at v, one of the following holds: (SI) 
J2peTT(v) "^viP) = 1, and for each P in 7r(t') with Wv{P) > 0, P >v Q', or (S2) There exists a proper suffix 
SofQ such that Ylpeniv s) '^v{P) = Wu{S), where u is the start node of S, and for each P G 7r(u, S) with 
Wy{P) >0,P>vQ. 

Given an F-CSR G with node set V, object set O, node preference relations >j for i £ V, and utility 
preference relations ^j for i G F, we consti^uct an instance I of FSPP as follows. For nodes i,j £ V and 
object Q G O, we introduce the following FSPP vertices. 

• hold(i, a) representing the amount of a that node i will store in its cache. 

• serve(i, j, q) representing the amount of a that node j will serve for i given a placement for V \ {i}. 

• serve' (z, j, a), an auxiliary vertex needed for serve(i, j, a). 

• serve (z, a), representing the amount of a that other nodes will serve for i given a placement forV\{i}. 

• hold(i), representing the best response of i give the placement of other nodes. 

• hold' (i, a), an auxiliary vertex needed for hold(i, a). 

We now present the path sets and preferences for each vertex of the FSPP instance. 
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• serve(j,a): the path set includes all paths of the form (serve(i, a),hold(j, a), d), and serve(i,a) 
prefers (serve(i, a),hold(j, a), d) over (serve(i, a),hold(A;, a), d) if j >i k. 

• serve'(i, j, a): the path set includes all paths of the form (serve'(i, j, a), serve(i, a),hold(j, a),d) 
and the direct path (serve'(i, j, a), d). For the preference order, serve'(i, j, a) prefers all paths 
(serve' (i,j, a), serve(i, a),hold(j, a), d) equally, and all of them over the direct path. 

• serve(i, j, a): the path set includes the path (serve' (i, j, a), d) and the direct path (serve(i, j, a),d) 
with a higher preference for the former path. 

• hold(i): the path set includes paths of the form (hold(i), serve(j, j, a), d), and hold(i) prefers the path 

(hold(i), serve(i, j, a), d) over (hold(i), serve(z, k, I3),d) if {j, a) 3i {k, /3). 

• hold'(i,a): the path set includes paths of the form (hold'(i,a),hold(i),serve(i, j, a), d) all of which 
are preferred equally, and the direct path (hold' (i, a), d) which is preferred the least. 

• hold(i, a): the path set includes two paths (hold' {i, a), d) and the direct path with a higher preference 
for the former path. 

We now show that the F-CSR instance has an equilibrium if and only if the FSPP instance has an 
equilibrium. Our proof is by giving a mapping / from global fractional placements in the F-CSR instance 
to feasible solutions in the FSPP instance such that (a) if P is an equilibrium for the F-CSR instance, then 
f{P) is an equilibrium for the FSPP instance, and (b) if w is an equilibrium for the FSPP instance, then 
f~^{w) is an equilibrium for the F-CSR instance. 

Let P denote any fractional placement of the F-CSR instance. We now define the solution f{P) of the 
FSPP instance. In f{P) vertex hold(i, a) plays Pi{a) on the direct path and 1 — Pi{a) on the other path in 
its path set, for every iinV and a in O. The remaining vertices play their best responses, considered in the 
following order. First, consider vertices of the form serve(i, a). In the best response, the amount played by 
serve(i, a) on the path (serve(i, a), hold(j, a),d), equals fi. p ^(j); recall that fi. p ^(j) is is the assignment 
that is lexicographically minimal under the node preference relation >i subject to the condition that fi- p ^ < 

Pk{a) for each k and J2k A*j p a^^^ ~ ^- ^^ ^^^^ consider the vertices of the form serve' {i, j, a). In its best 
response, vertex serve' (i,j, a) plays fi. p ^{j) on the path (serve' (i,j, a), serve(i, a),hold(j, a), d). Next, 
in its best response, vertex serve(i, j, a) plays fi. p ^{j) on its direct path and I — fx-p ^(j) on its remaining 
path. We now consider the best response of vertex hold(i); it distributes its unit among paths of the form 
(hold(i),serve(j, j, q), d) (for all j in y \ {i} and a in O) lexicographically maximally under the total 
preorder Zlj over node-object pairs. That is, hold(i) plays b. p{a,j) on the path (hold(i), serve(i, j, a), d). 
We next consider the best response of the vertex hold' (i, a); it plays 1 — X^i ^j pi'^^j) on its direct path and 
&j p(a,i) on the path (hold'(i, a),hold(i), serve(i, j, a), d). This completes the definition of the solution 

fin 

We now argue that if P is an equilibrium so is f{P). By construction, every vertex other than of the 
form hold(i, a) play their best responses in f{P). We next show that i plays a best response in P if and 
only if the vertices hold(i, a) play their best response in f{P). The best response of hold(i, a) is to play 
1 — ^6^ p{a,j) on the path (hold(i, Q),hold'(i, a), d) and the ^ b.p{a,j) on its direct path. The best 

responseof iinPis toset Pj(a) to J2j ^j p{oid)- Thus if Pis an equilibrium, then so is f{P). Furthermore, 

if w is an equilibrium, by definition of f, P = f~^{w) is well-defined. Since the best responses of i and 
the vertices hold(j, a) are consistent, P is also an equilibrium. This completes the reduction from F-CSR 
to FSPP, placing F-CSR in PPAD. D 



22 



7.3 PPAD-Hardness 

This section is devoted to the proof of the following theorem. 

Theorem 28. Tlie problem of finding an equilibrium in F-CSR games is PPAD-hard even when the under- 
lying cost function d is a metric. 

Our reduction is from preference games 1.15.1 . Given a preference game G with n players 1, 2, . . . , n 
and their preferences given by >j, we construct an F-CSR game G as follows. The game G has a set 
V of n? + 3n players numbered 1 through n^ + 3n, and a set O of 2n objects ai, . . . , a2n- We set the 
utility function for each node to be the sum utility function, thus ensuring that the desired monotonicity and 
consistency conditions are satisfied. 

We next present the metric cost function d over the nodes. We group the players into four sets Vi = 
{i : 1 < i < n}, V2 = {i • n + j : 1 < i < n,l < j < n}, V3 = {v? -\-n + i: \<i< n}, and 
V4 = {n? + 2n -\- i : 1 < i < n}. For each node i in Vi and j in V3, we set da = 2 and dij = 4. We 
set (i^2_|_„_,_j „2_,_2„_|_j = 3. For each node i in Vi and A; = i • n + j, we set dik as follows: if j >j i then 
dij equals 6 — l/n when j is the Ah most preferred player for i; if i >j j, then dij equals 1. All the other 
distances are obtained by using metric properties. 

We finally specify the object weights. For /c G l/i, we set rk{ai) = 1 for dXli ^ k such that i >k k;we 
set rk{cik) = 2.5 such that 4 < 2rk{cik) < 5. For node A; = i • n + j in V2, we set rk{oi.j) = 1. For node 
A; = n^ + n + i in V3, we set rfc(aj) = rfc(aj+„) = 1. Finally, for node k = n? + 2n + iin V4, we set 

rk{0Li+n) = 1- 

Given a placement P for G, we define a solution a;(P) = {wij} for the preference game G: Wij = 
Pi{aj). The following lemma immediately follows from the definition of G. 

Lemma 29. The following statements hold for any placement P for G. 

• For k = i ■ n + j, 1 < j < n, Pk is a best response to P^k if and only if Pk{aj) = 1. 

• For k = n? -\- n -\- i, 1 < i < n, Pk is a best response to P-k if and only if Pk{a.n+i) = 1- 

• For k = V? -\- n -\- i, Pk is a best response to P^k if and only if Pk{a.i) = 1 — Pi{ai) and Pk{cin+i = 

P:{ai). 

Lemma 30. Let P be a placement for G in which every node not in Vi plays their best response. Then, the 
best response of a node i in Vi is the lexicographically maximum [Pi[aj^).,Pi{aj^), . . . , Pi{aj^)), where 
ji >i J2 >i--->i jn, subject to the constraint that Pi{aj) < Pj{aj) for j ^ i. D 

Proof. Fix a node imVi. By Lemma [29) node i-n + j holds object j, for 1 < j < n; each of these nodes is 



at distance at least 5 and at most 6 away from i. By Lemma 29 for every node A; = n + n + j, 1 < j < n, 
Pkiaj) = 1 - Pjiaj) and Pfc(a„+j) = Pj{aj). 

We now consider the best response of node i. We first note that for any j £ {1, . . . , n} \ {i} such that 
i >i j, Pi{c(j) = since the nearest full copy of aj is nearer than the nearest node holding any fraction 
of object ttj. Let S denote the set of j such that j >i i. For any j in S \ {i}, Pi{aj) < Pj{aj) since 
node n^ + n + j at distance 5 holds 1 — Pj{aj) fraction of aj, the nearest node holding any fraction of 
tti is at distance 4, and 4rj(aj) > 5rj(aj). Furthermore, for any j, A; in S if j >i k, then the farthest 
Pj{aj) fraction of aj is farther than the farthest Pk{ak) fraction of ak, implying that in the best response, if 
Pi{aj) < Pj{aj) then Pi{ak) = 0. Thus, the best response of i is the unique lexicographically maximum 
solution {Pi{aj^),Pi{aj2), ■ ■ ■ ,Pi{aj^)), where ji >i J2 >i ••• >« jn, subject to the constraint that 
Pi{aj) < Pj{aj) for j ^ i. D 
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Lemma 31. A placement P is an equilibrium for G if and only ifu){P) is a equilibrium for G and every 
node not in V\ plays their best response in P. 

Proof. Consider an equilibrium placement P for G Clearly, every node plays their best response. We now 
prove that uj{P) is an equilibrium for G. Fix a node iinVi. By Lemma [30} the best response of i is the 
unique lexicographically maximum solution {Pi{aj-^^) , Pi{aj^) , . . . , Pi{aj^)), where ji >j J2 >i--->i jn, 
subject to the constraint that Pi{aj) < Pj{aj) for j / i. Since this applies to every node i, it is immediate 
from the definitions of uj{P) and preference games that if P is an equilibrium for G then uj{P) is an 
equilibrium for G. 

We now consider the reverse direction. Suppose we have a placement P in which every player not in 



Vi plays their best response and uj{P) is an equilibrium for the preference game G. By Lemma 30 and the 
definition of uj{P), the best response of i in G matches that in the F-CSR game; hence every player in Vi 
also plays their best response in P, implying that P is an equilibrium for G. D 

The construction of G from G is clearly polynomial time. Furthermore, given any equilibrium for G, 
an equilibrium for G can be constructed in linear time. We thus have a reduction from a PPAD-complete 



problem to F-CSR implying that the latter is PPAD-hard, thus completing the proof of Theorem 28 



8 Concluding remarks 

In this paper we have defined integral and fractional selfish replication games (CSR and F-CSR) in net- 
works, where the cache capacity of each node is bounded and all objects are of uniform size. We have 
shown that a pure Nash equilibrium can be computed for every hierarchical network, using a new notion of 
fictional players. In general, we have almost completely characterized the complexity of CSR games: For 
what classes of games do equilibria exist? Can we determine efficiently whether they exist? When they 
do exist, can we efficiently find them? One complexity question that is still open is the case of undirected 
networks with binary preferences. We conjecture that finding equilbria in such games (which we prove 
are potential games) is PLS-hard. In general, we would like to study the convergence of the best response 
process for the cases of games where equilibria exist. 

We showed that F-CSR games always have equilibria, though they may be hard to find. It is not hard to 
argue that an equilibrium in the corresponding integral variant is an equilibrium in the fractional instance. 
So whenever an "integral" equilibrium can be determined efficiently, so can a "fractional" equilibrium. An 
interesting direction of research is to identify other special cases of fractional games where equilibria may 
be efficiently determined. We also note that our proof of existence of equilibria in F-CSR games, currently 
presented for the case of unit-size objects, extends to arbitrary object sizes. 

Finally, we note that our model assumes that the sets of nodes, objects, and preference relations are all 
static. We believe our results will be meaningful for environments where these sets change infrequently. De- 
veloping better models for addressing more dynamic scenarios is an important practical research direction. 
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